Exploration and Exploitation During Sequential Search
نویسندگان
چکیده
منابع مشابه
Exploration and Exploitation During Sequential Search
When we learn how to throw darts we adjust how we throw based on where the darts stick. Much of skill learning is computationally similar in that we learn using feedback obtained after the completion of individual actions. We can formalize such tasks as a search problem; among the set of all possible actions, find the action that leads to the highest reward. In such cases our actions have two o...
متن کاملExploration and exploitation during information search and consequential choice
Before making a choice we often search and explore the options available. For example, we try clothes on before selecting the one to buy and we search for career options before deciding a career to pursue. Although the exploration process, where one is free to sample available options is pervasive, we know little about how and why humans explore an environment before making choices. This resear...
متن کاملSearch in patchy media: Exploitation-exploration tradeoff.
How to best exploit patchy resources? We introduce a minimal exploitation-migration model that incorporates the coupling between a searcher's trajectory, modeled by a random walk, and ensuing depletion of the environment by the searcher's consumption of resources. The searcher also migrates to a new patch when it takes S consecutive steps without finding resources. We compute the distribution o...
متن کاملTowards Effective Exploration/Exploitation in Sequential Music Recommendation
Music streaming companies collectively serve billions of songs per day. Radio-based music services may intersperse audio advertisements among the songs as a means to generate revenue, much like traditional FM radio. Regardless of the monetization approach, the recommender system should decide when to play content that the listener is known to enjoy (exploit) and content that is novel to the lis...
متن کاملSponsored Search and the Exploration-Exploitation-Outsourcing Dilemma
Motivated by the recently proposed (and eventually abandoned) partnership in sponsored search advertising between Yahoo! and Google, we examine a model for ad selection in which an agent must balance directly learning which ads perform best and outsourcing ad slots to another party in exchange for a share of any click revenue. Our framework captures a fundamental tension between the potential s...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Cognitive Science
سال: 2009
ISSN: 0364-0213,1551-6709
DOI: 10.1111/j.1551-6709.2009.01021.x